SEQUENCE LISTING 



<110> CNRS 



<120> STARCH GRANULES CONTAINING A RECOMBINANT POLYPEPTIDE 

OF INTEREST, A METHOD OF OBTAINING THEM AND THEIR USES 

<130> WOB 99 AB CNR AMYL 

<140> US 09/980,771 
<141> 2001-11-15 



FR 99/06494 
1999-05-21 




<150> 
<151> 

<160> 



<170> Patentln version 3.1 

<210> 1 

<211> 3117 

<212> DNA 

<213> Chlamydomonas reinhardtii 



<400> 1 

gccaggcgga 

tcatcaatgc 

ctcgtggctc 

cttgcgcgct 

gcctgggcga 

tgaccattgc 

acatcatggg 

ggattgacca 

gcccccgctc" 

ccgctattga 

ccaacgactg 

gccagttcac 

tgtgggagga 

tctcggacgg 

ccccgctgac 

ccgacaagct 

gcggtgtgga 

tggacattga 

acagcgtcta 

ctgtggaccc 

tggacatcat 

tcctgggtac 

agggccgcgc 

gcgccgactt 

ccatgcacta 

aggagggcgt 

acgccgacgc 

accccgagat 

agtgggaggg 

aggaggagat 

acgcccccaa 

ccaaggtcgg 

cctcgcccgc 

ccaagcccaa 

agaacggcgc 

ccaagcccct 



gcgtatggct 
cgcgtcgttc 
cgcacgcaag 
ggacatcgtg 
tgtgactggt 
ccctcgctac 
cgagaaggtc 
cccctggttc 
cggcgctgac 
ggctgcccgc 
gcactccgcc 
caaggccaag 
ggctttcaag 
ctatgccaag 
gggaaagacc 
ggtgactgtg 
gctggacacc 
ggagtggaac 
cgccggcaag 
caccgccccc 
cctggccgcc 
cggcaaggcc 
caagggcgtg 
catgctggtg 
cggtaccgtg 
caccggcttc 
cctggccgcc 
ggtggccaac 
cctgctggag 
caaggtgccc 
caccctgaag 
caccaccgcc 
cgccgccacc 
gaccgctggc 
tgcctccaac 
ggtctccgcc 



gttgcctcta 
ggtgtcaaga 
tccacctcgc 
atggttgctg 
ggcctgccta 
gaccagtacg 
cgctacttcc 
ctggccaagg 
tacctggaca 
gtgctgccct 
ctggtgcccg 
tcggtgctgg 
gacacgaagc 
gtttacactg 
tacaagaaga 
tcgcccaact 
gtcatccgcg 
cccaagaccg 
gccgccgcca 
ctgttcgcct 
ctgcccaaga 
gcctacgaga 
gtcaagttct 
ccct cgcgct 
cccgtggtag 
cacatgggcg 
accgtgcgcc 
tgcatcagcc 
gaggtggtgt 
gttgccgaga 
cccgtgtccg 
cccgccatgg 
cccaaggtga 
ctcaagctgg 
ggcaacggca 
gccacccgca 



ccagccgccc 
agaccgcgaa 
gctcggctgt 
ctgaggtcgc 
ttgagctggt 
ctgacgcctg 
actccatcaa 
tctggggcaa 
accacaagcg 
tcggccccgg 
tcctgctgaa 
ctatccacaa 
tgccccaggc 
aggccacccc 
tcaactggct 
acgcgaccga 
ccaagggcat 
acaagttcct 
aggaggccct 
tcatcggccg 
tcctggccac 
agctggtgaa 
cggcgcccct 
tcgagccctg 
cctccaccgg 
ccctgaaccc 
gtgccagcga 
aggacctgtc 
acggcaaggg 
agatccccgg 
cctccgtgga 
gcgcgtggcg 
ccacctacaa 
ccggtgaggc 
acggtgcctc 
agtccgccta 



cagcagcgcg 
ccagctgctg 
tactggtgcc 
cccttggtcc 
caagcgcggc 
ggacacctcg 
gaagggcgtg 
gaccggctcc 
cttcgccctg 
cgaggactgc 
ggacgagtac 
catcgccttc 
cgcctttgac 
catggaggag 
gaagggtggc 
gatcgctgcc 
tgagggcatt 
gtctgcgccc 
gcaggccgag 
cctggaggag 
ccccaaggtg 
cgccatcggc 
ggcgcacatg 
cggcctgatc 
cggcctggtc 
cgacaagctg 
ggtgtttgcg 
ctggtccaag 
cggcgtggcc 
cgacctgccc 
gggcaacggc 
cgcgaccacc 
gcccgccctg 
ctccaccacc 
ggcctccaag 
aagcggcagt 



cgtcctatcg 
cgtgagcttg 
actggtgcca 
aagacgggcg 
caccgcgtca 
gtggtcgtgg 
caccgcgtgt 
aagctgtacg 
ttctgcaagg 
gtcttcgtgg 
cagcccaagg 
cagggccgca 
aagctggcct 
gacgagaagc 
attatcgccg 
gatgccgccg 
gtgaacggca 
tqcgaccaga 
ctgggcctgc 
cagaagggtg 
cagatcgcca 
accaagtaca 
ctcaccgccg 
cagctgcacg 
gacaccgtca 
gacgaggctg 
ggcggccgct 
cccgcccaga 
accgccaaga 
gccgtgtcct 
gccgccgcgc 
ccctcgggcc 
cccgccaccg 
tcgacctcgg 
acctcggctg 
agccgcagag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



gcggcgacag 
gcggccgcca 
gagcatggca 
tgaagtcggt 
gtttcggttt 
tgcgcggcgc 
tcggcggcaa 
gagagtttgc 
cctccggcaa 
ggatacataa 
ggttttgttg 
cttgggctca 
ggctcgaagc 
cttgactgga 
acccagtagt 
tgagagagta 



catgagcggc 
ccggcgagga 
ggcggaaacg 
tttgcgagac 
tgcccaagtc 
ggcgaccacg 
gagggcgggt 
cattttgtgg 
cggagaagtg 
gagcgtgtgg 
aggtgcagac 
tttttggtgc 
gagggtttgt 
ggtacacaaa 
gtggaaagta 
acaaagatgg 



tcgaccaaag 
gcaggcttgc 
tgtgtactga 
cggagaaacg 
caaaagaaga 
ccggcccgtg 
ctgtacactc 
gacgtgagcg 
ggaggcgctg 
aatggtggta 
ggcaccgtcg 
ccgtcgatga 
ggaagtggag 
gtggagcgtg 
gttggacctg 
cacctgaaaa 



ctgtggcagg 
ggcagcgagg 
aatgtggtgc 
ccggtttggt 
gtaacgaaac 
cgcagcctgt 
catccattcc 
gcgggacggc 
tagcccggtg 
aaagaggagg 
gcgtcaaagg 
tgagagattg 
cgaggagggt 
cgacggcacg 
aattctttga 
aaaaaaaaaa 



aacggctgta 
gcgatgagct 
atgagagtgt 
tttgtagtgc 
tgtagcagta 
cctgccctca 
aggatttttg 
cgcgccgggc 
accccccaat 
ggcctgggtc 
ccctcgcaag 
gccagcggtt 
tggagaaaga 
gaggcattgg 
gagtaccgcg 
aaaaaaaaaa 



gcagcggcag 
tagcggccgt 
cgtgctgtaa 
agggcctgtg 
gcagagcact 
gccttgtgat 
caggctgcct 
tctcctaccg 
gtagaggatg 
gcccctcgat 
gcccgggtgc 
ttttgaggct 
ggcggacatg 
cggactattg 
cattaatccg 
aaaaaaa 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3117 



<210> 2 

<211> 2124 

<212> DNA 

<213> artificial sequence 



<220> 
<223> 



fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii 



<220> 
<221> 
<222> 
<223> 



CDS 
(1) . 



(2124) 



<4O0> 2 

atg get gtt gec tct acc age cgc ccc age age gcg cgt cct ate gtc 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
1 5 10 15 



48 



ate aat gee gcg teg ttc ggt gtc aag aag acc gcg aac cag ctg ctg 
lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 



96 



cgt gag ctt get cgt ggc tec gca cgc aag tec acc teg cgc teg get 
Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Sjer Ala 
35 40 45 



144 



gtt act ggt gee act ggt gec act tgc gcg ctg gac ate gtg atg gtt 
Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp lie Val Met Val 
50 55 60 



192 



get get gag gtc gee cct tgg tec aag acg ggc ggc ctg ggc gat gtg 
Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



240 



act ggt ggc ctg cct att gag ctg gtc aag cgc ggc cac cgc gtc atg 
Thr Gly Gly Leu Pro lie Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 



288 



acc att gec cct cgc tac gac cag tac get gac gee tgg gac acc teg 
Thr lie Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 



336 



■3 



gtg gtc gtg 
Val Val Val 
115 

aag aag ggc 
Lys Lys Gly 
130 

aag gtc tgg 
Lys Val Trp 
145 

get gac tac 
Ala Asp Tyr 



get att gag 
Ala He Glu 



gtc ttc gtg 
Val Phe Val 
195 

aag gac gag 
Lys Asp Glu 
210 

ctg get ate 
Leu Ala He 
225 

ttc aag gac 
Phe Lys Asp 



teg gac ggc 
Ser Asp Gly 



gac gag aag 
Asp Glu Lys 
275 

ctg aag ggt 
Leu Lys Gly 
290 

aac tac gcg 
Asn Tyr Ala 
305 

gac acc gtc 
Asp Thr Val 



gac att gag 
Asp He Glu 



gac ate atg 
Asp He Met 



gtg cac cgc 
Val His Arg 



ggc aag acc 
Gly Lys Thr 
150 

ctg gac aac 
Leu Asp Asn 
165 

get gee cgc 
Ala Ala Arg 
180 

gee aac gac 
Ala Asn Asp 



tac cag ccc 
Tyr Gin Pro 



cac aac ate 
His Asn He 
230 

acg aag ctg 
Thr Lys Leu 
245 

tat gee aag 
Tyr Ala Lys 
260 

ccc ccg ctg 
Pro Pro Leu 



ggc att ate 
Gly He He 



acc gag ate 
Thr Glu He 
310 

ate cgc gee 
He Arg Ala 
325 

gag tgg aac 
Glu Trp Asn 
340 



ggc gag aag 
Gly Glu Lys 
120 

gtg tgg att 
Val Trp He 
135 

ggc tec aag 
Gly Ser Lys 



cac aag cgc 
His Lys Arg 



gtg ctg ccc 
Val Leu Pro 
185 

tgg cac tec 
Trp His Ser 
200 

aag ggc cag 
Lys Gly Gin 
215 

gee ttc cag 
Ala Phe Gin 



ccc cag gee 
Pro Gin Ala 



gtt tac act 
Val Tyr Thr 
265 

acg gga aag 
Thr Gly Lys 
280 

gee gee gac 
Ala Ala Asp 
295 

get gee gat 
Ala Ala Asp 



aag ggc att 
Lys Gly lie 



ccc aag acc 
Pro Lys Thr 
345 



gtc cgc tac 
Val Arg Tyr 



gac cac ccc 
Asp His Pro 
140 

ctg tac ggc 
Leu Tyr Gly 
155 

ttc gee ctg 
Phe Ala Leu 
170 

ttc ggc ccc 
Phe Gly Pro 



gee ctg gtg 
Ala Leu Val 



ttc acc aag 
Phe Thr Lys 
220 

ggc cgc atg 
Gly Arg Met 
235 

gee ttt gac 
Ala Phe Asp 
250 

gag gee acc 
Glu Ala Thr 



acc tac aag 
Thr Tyr Lys 



aag ctg gtg 
Lys Leu Val 
300 

gee gee ggc 
Ala Ala Gly 
315 

gag ggc att 
Glu Gly He 
330 

gac aag ttc 
Asp Lys Phe 



ttc cac tec 
Phe His Ser 
125 

tgg ttc ctg 
Trp Phe Leu 



ccc cgc tec 
Pro Arg Ser 



ttc tgc aag 
Phe Cys Lys 
175 

ggc gag gac 
Gly Glu Asp 
190 

ccc gtc ctg 
Pro Val Leu 
205 

gee aag teg 
Ala Lys Ser 



tgg gag gag 
Trp Glu Glu 



aag ctg gee 
Lys Leu Ala 
255 

ccc atg gag 
Pro Met Glu 
270 

aag ate aac 
Lys He Asn 
285 

.( 

act gtg teg 
Thr Val Ser 



ggt gtg gag 
Gly Val Glu 



gtg aac ggc 
Val Asn Gly 
335 

ctg tct gcg 
Leu Ser Ala 
350 



ate 384 
He 



gee 432 
Ala 



ggc 480 

Gly 

160 

gee 528 
Ala 



tgc 576 
Cys 



ctg 624 
Leu 



gtg 672 
Val 



get 720 

Ala 

240 

ttc 768 
Phe 



gag 816 
Glu 



tgg 864 
Trp 



ccc 912 
Pro 



ctg 960 

Leu 

320 

atg 1008 
Met 



ccc 1056 
Pro 



.4 



tac gac cag aac age gtc tac gec ggc aag gec gec gec aag gag gec 1104 
Tyr Asp Gin Asn Ser Val Tyr Ala Gly Lys Ala Ala Ala Lys Glu Ala 
355 360 365 

ctg cag gec gag ctg ggc ctg cct gtg gac ccc acc gec ccc ctg ttc 1152 
Leu Gin Ala Glu Leu Gly Leu Pro Val Asp Pro Thr Ala Pro Leu Phe 
370 375 380 

gec ttc ate ggc cgc ctg gag gag cag aag ggt gtg gac ate ate ctg 1200 
Ala Phe lie Gly Arg Leu Glu Glu Gin Lys Gly Val Asp lie lie Leu 
385 390 395 400' 

gee gec ctg ccc aag ate ctg gec acc ccc aag gtg cag ate gee ate 1248 
Ala Ala Leu Pro Lys lie Leu Ala Thr Pro Lys Val Gin lie Ala lie 
405 410 415 

ctg ggt acc ggc aag gee gee tac gag aag ctg gtg aac gee ate ggc 1296 
Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 

acc aag tac aag ggc cgc gee aag ggc gtg gtc aag ttc teg gcg ccc 134 4 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

ctg gcg cac atg etc acc gec ggc gee gac ttc atg ctg gtg ccc teg 1392 
Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

cgc ttc gag ccc tgc ggc ctg ate cag ctg cac gee atg cac tac ggt 1440 
Arg Phe Glu Pro Cys Gly Leu lie Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

acc gtg ccc gtg gta gee tec acc ggc ggc ctg gtc gac acc gtc aag 1488 
Thr Val Pro Val Val Ala Ser Thr- Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

gag ggc gtc acc ggc ttc cac atg ggc gee ctg aac ccc gac aag ctg 1536 
Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

gac gag get gac gee gac gee ctg gee gee acc gtg cgc cgt gec age 1584 
Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

gag gtg ttt gcg ggc ggc cgc tac ccc gag atg gtg gec aac tgc ate 1632 
Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys He 
530 535 540 

age cag gac ctg tec tgg tec aag ccc gec cag aag tgg gag ggc ctg 1680 
Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
.545 550 555 560 

ctg gag gag gtg gtg tac ggc aag ggc ggc gtg gec acc gee aag aag 1728 
Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

gag gag ate aag gtg ccc gtt gee gag aag ate ccc ggc gac ctg ccc 177 6 

Glu Glu He Lys Val Pro Val Ala Glu Lys He Pro Gly Asp Leu Pro 
580 585 590 



gcc gtg tec tac gec ccc aac acc ctg aag ccc gtg tec gec tec gtg 1824 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 

595 600 605 

gag ggc aac ggc gcc gcc gcg ccc aag gtc ggc acc acc gcc ccc gcc 1872 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 

610 615 620 

atg ggc gcg tgg cgc gcg acc acc ccc teg ggc ccc teg ccc gcc gcc 1920 

Met Gly Ala Trp- Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 

625 630 635 640 

gcc acc ccc aag gtg acc acc tac aag ccc gcc ctg ccc gcc acc gcc 1968 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 

645 650 655 

aag ccc aag acc get ggc etc aag ctg gcc ggt gag gcc tec acc acc 2016 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 

660 665 670 

teg acc teg gag aac ggc get gcc tec aac ggc aac ggc aac ggt gcc 2064 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 

675 680 685 

teg gcc tec aag acc teg get gcc aag ccc ctg gtc tec gcc gcc acc 2112 

Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 

690 695 700 

cgc aag tec gcc 2124 

Arg Lys Ser Ala 

705 



<210> 3 

<211> 708 

<212> PRT 

<213> artificial sequence 



<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii 

<400> 3 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro fie Val 
15 10 15 

lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp lie Val Met Val 
50 55 60 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



Thr Gly Gly Leu Pro lie Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 



•6 



Thr lie Ala Pro 
100 

Val Val Val Asp 
115 

Lys Lys Gly Val 
130 

Lys Val Trp Gly 
145 

Ala Asp Tyr Leu 



Ala He Glu Ala 
180 

Val Phe Val Ala 
195 

Lys Asp Glu Tyr 
210 

Leu Ala He His 
225 

Phe Lys Asp Thr 



Ser Asp Gly Tyr 
260 

Asp Glu Lys Pro 
275 

Leu Lys Gly Gly 
290 

Asn Tyr Ala Thr 
305 

Asp Thr Val lie 



Asp He Glu Glu 
340 

Tyr Asp Gin Asn 
355 

Leu Gin Ala Glu 
370 

Ala Phe He Gly 
385 

Ala Ala Leu Pro 



Arg Tyr Asp Gin 



He Met Gly Glu 
120 

His Arg Val Trp 
135 

Lys Thr Gly Ser 
150 

Asp Asn His Lys 
165 

Ala Arg Val Leu 



Asn Asp Trp His 
200 

Gin Pro Lys Gly 
215 

Asn He Ala Phe 
230 

Lys Leu Pro Gin 
245 

Ala Lys Val Tyr 



Pro Leu Thr Gly 
280 

He He Ala Ala 
295 

Glu He Ala Ala 
310 

Arg Ala. Lys Gly 
325 

Trp Asn Pro Lys 



Ser Val Tyr Ala 
360 

Leu Gly Leu Pro 
375 

Arg Leu Glu Glu 
390 

Lys He Leu Ala 
405 



Tyr Ala Asp Ala 
105 

Lys Val Arg Tyr 



He Asp His Pro 
140 

Lys Leu Tyr Gly 
155 

Arg Phe Ala Leu 
170 

Pro Phe Gly Pro 
185 

Ser Ala Leu Val 



Gin Phe Thr Lys 
220 

Gin Gly Arg Met 
235 

Ala Ala Phe Asp 
250 

Thr Glu Ala Thr 
265 

Lys Thr Tyr Lys 



Asp Lys Leu Val 
300 

Asp Ala Ala Gly 
315 

He Glu Gly He 
330 

Thr Asp Lys Phe 
345 

Gly Lys Ala Ala 



Val Asp Pro Thr 
380 

Gin Lys Gly Val 
395 

Thr Pro Lys Val 
410 



Trp Asp Thr Ser 
110 

Phe His Ser He 
125 

Trp Phe Leu Ala 



Pro Arg Ser Gly 
160 

Phe Cys Lys Ala 
175 

Gly Glu Asp Cys 
190 

Pro Val Leu Leu 
205 

Ala Lys Ser Val 



Trp Glu Glu Ala 
240 

Lys Leu .Ala Phe 
255 

Pro Met Glu Glu 
270 

Lys lie Asn Trp 
285 

Thr Val Ser Pro 



Gly Val Glu Leu 
320 

Val Asn Gly Met 
335 

Leu Ser Ala Pro 
350 

Ala Lys Glu Ala 
365 

Ala Pro Leu Phe 



Asp He He Leu 
400 

Gin He Ala He 
415 



Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

Arg Phe Glu Pro Cys Gly Leu lie Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys lie 
530 535 540 

Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

Glu Glu lie Lys Val Pro Val Ala Glu Lys lie Pro Gly Asp Leu Pro 
580 585 590 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser T /al 
595 600 605 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 ^55 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 



Arg Lys 
705 



Ser Ala 



<210> 4 

<211> 1953 

<212> DNA 

<213> artificial sequence 



<220> 

<223> fragment of the complete sequence of. cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii and 
coding for the mature GBSSI protein 

<220> 

<221> CDS 

<222> (1) . . (1953) 

<223> 



<400> 4 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 48 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 



aag cgc ggc cac cgc gtc atg ace att gec cct cgc tac gac cag tac 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 



144 



get gac gec tgg gac ace teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie r 
65 70 75 80 

gac cac ccc tgg ttc ctg gec aag gtc tgg ggc aag ace ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gec get att gag get gee cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gec ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gec aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn He Ala Phe Gin 
165 170 175 



.9 



ggc cgc atg 
Gly Arg Met 



gcc ttt gac 
Ala Phe Asp 
195 

gag gcc acc 
Glu Ala Thr 
210 

acc tac aag 
Thr Tyr Lys 
225 

aag ctg gtg 
Lys Leu Val 



gcc gcc ggc 
Ala Ala Gly 



gag ggc att 
Glu Gly lie 
275 

gac aag ttc 
Asp Lys Phe 
290 

aag gcc gcc 
Lys Ala Ala 
305 

gac ccc acc 
Asp Pro Thr 



aag ggt gtg 
Lys Gly Val 



ccc aag gtg 
Pro Lys Val 
355 

aag ctg gtg 
Lys Leu Val 
370 

gtg gtc aag 
Val Val Lys 
385 

gac ttc atg 
Asp Phe Met 



tgg gag gag 
Trp Glu Glu 
180 

aag ctg gcc 
Lys Leu Ala 



ccc atg gag 
Pro Met Glu 



aag ate aac 
Lys lie Asn 
230 

act gtg teg 
Thr Val Ser 
245 

ggt gtg gag 
Gly Val Glu 
260 

gtg aac ggc 
Val Asn Gly 



ctg tct gcg 
Leu Ser Ala 



gcc aag gag 
Ala Lys Glu 
310 

gcc ccc ctg 
Ala Pro Leu 
325 

gac ate ate 
Asp lie lie 
340 

cag ate gcc 
Gin He Ala 



aac gcc ate 
Asn Ala He 



ttc teg gcg 
Phe Ser Ala 
390 

ctg gtg ccc 
Leu Val Pro 
405 



get ttc aag 
Ala Phe Lys 
185 

ttc teg gac 
Phe Ser Asp 
200 

gag gac gag 
Glu Asp Glu 
215 

tgg ctg aag 
Trp Leu Lys 



ccc aac tac 
Pro Asn Tyr 



ctg gac acc 
Leu Asp Thr 
265 

atg gac att 
Met Asp He 
280 

ccc tac gac 
Pro Tyr Asp 
295 

gcc ctg cag 
Ala Leu Gin 



ttc gcc ttc 
Phe Ala Phe 



ctg gcc gcc 
Leu Ala Ala 
345 

ate ctg ggt 
He Leu Gly 
360 

ggc acc aag 
Gly Thr Lys 
375 

ccc ctg gcg 
Pro Leu Ala 



teg cgc ttc 
Ser Arg Phe 



gac acg aag 
Asp Thr Lys 



ggc tat gcc 
Gly Tyr Ala 



aag ccc ccg 
Lys Pro Pro 
220 

ggt ggc att 
Gly Gly He 
235 

gcg acc gag 
Ala Thr Glu 
250 

gtc ate cgc 
Val He Arg 



gag gag tgg 
Glu Glu Trp 



cag aac age 
Gin Asn Ser 
300 

gcc gag ctg 
Ala Glu Leu 
315 

ate ggc cgc 
He Gly Arg 
330 

ctg ccc aag 
Leu Pro Lys 



acc ggc aag 
Thr Gly Lys 



tac aag ggc 
Tyr Lys Gly 
380 

cac atg etc 
His Met Leu 
395 

gag ccc tgc 
Glu Pro Cys 
410 



ctg ccc cag 
Leu Pro Gin 
190 

aag gtt tac 
Lys Val Tyr 
205 

ctg acg gga 
Leu Thr Gly 



ate gcc gcc 
He Ala Ala 



ate get gcc 
He Ala Ala 
255 

gcc aag ggc 
Ala Lys Gly 
270 

aac ccc aag 
Asn Pro Lys 
285 

gtc tac gcc 
Val Tyr Ala 



ggc ctg cct 
Gly Leu Pro 



ctg gag gag 
Leu Glu Glu 
335 

ate ctg gcc 
He Leu Ala 
350 

.t 

gcc gcc tac 
Ala Ala Tyr 
365 

cgc gcc aag 
Arg Ala Lys 



acc gcc ggc 
Thr Ala Gly 



ggc ctg ate 
Gly Leu He 
415 



gcc 576 
Ala 



act 624 
Thr 



aag 672 
Lys 



gac 720 

Asp 

240 

gat 768 
Asp 



att 816 
He 



acc 864 
Thr 



ggc 912 
Gly 



gtg 960 

Val 

320 

cag 1008 
Gin 



acc 1056 
Thr 



gag 1104 
Glu 



ggc 1152 
Gly 



gcc 1200 

Ala 

400 

cag 1248 
Gin 



10 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 134 4 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 14 4 0 

Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc ggc gac ctg ccc gcc gtg tec tac gcc ccc aac acc ctg 1632 
Lys lie Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 

aag ccc gtg tec gcc tec gtg gag ggc aac ggc gcc gcc gcg ccc aag 1680 
Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 

gtc ggc acc acc gcc ccc gcc atg ggc gcg tgg cgc gcg acc acc ccc 1728 
Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 
565 570 575 

teg ggc ccc teg ccc gcc gcc gcc acc ccc aag gtg acc acc tac aag 177 6 

Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 

ccc gcc ctg ccc gcc acc gcc aag ccc aag acc get ggc etc aag ctg 1824 
Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

gcc ggt gag gcc tec acc acc teg acc teg gag aac ggc get gcc tec 1872 
Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

aac ggc aac ggc aac ggt gcc teg gcc tec aag acc teg get gcc aag 1920 
Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

ccc ctg gtc tec gcc gcc acc cgc aag tec gcc 1953 
Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 5 
<211> 651 
<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii and 
coding for the mature GBSSI protein 

<400> 5 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 .60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 7 0 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
14 5 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 



Glu Gly He Val 
275 

Asp Lys Phe Leu 
290 

Lys Ala Ala Ala 
305 

Asp Pro Thr Ala 



Lys Gly Val Asp 
340 

Pro Lys Val Gin 
355 

Lys Leu Val Asn 
370 

Val Val Lys Phe 
385 

Asp Phe Met Leu 



Leu His Ala Met 
420 

Gly Leu Val Asp 
435 

Ala Leu Asn Fro 
450 

Ala Thr Val Arg 
465 

Glu Met Val Ala 



Ala Gin Lys Trp 
500 

Gly Val Ala Thr 
515 

Lys He Pro Gly 
530 

Lys Pro Val Ser 
545 

Val Gly Thr Thr 



Ser Gly Pro Ser 
580 



Asn Gly Met Asp 
280 

Ser Ala Pro Tyr 
295 

Lys Glu Ala Leu 
310 

Pro Leu Phe Ala 
325 

He He Leu Ala 



He Ala He Leu 
360 

Ala He Gly Thr 
375 

Ser Ala Pro Leu 
390 

Val Pro Ser Arg 
405 

His Tyr Gly Thr 



Thr Val Lys Glu 
440 

Asp Lys Leu Asp 
455 

Arg Ala Ser Glu 
470 

Asn Cys He Ser 
485 

Glu Gly Leu Leu 



Ala Lys Lys Glu 
520 

Asp Leu Pro Ala 
535 

Ala Ser Val Glu 
550 

Ala Pro Ala Met 
565 

Pro Ala Ala Ala 



He Glu Glu Trp 



Asp Gin Asn Ser 
300 

Gin Ala Glu Leu 
315 

Phe He Gly Arg 
330 

Ala Leu Pro Lys 
345 

Gly Thr Gly Lys 



Lys Tyr Lys Gly 
380 

Ala His Met Leu 
395 

Phe Glu Pro Cys 
410 

Val Pro Val Val 
425 

Gly Val Thr Gly 



Glu Ala Asp Ala 
460 

Val Phe Ala Gly 
475 

Gin Asp Leu Ser 
490 

Glu Glu Val Val 
505 

Glu He Lys Val 



Val Ser Tyr Ala 
540 

Gly Asn Gly Ala 
555 

Gly Ala Trp Arg 
570 

Thr Pro Lys Val 
585 



Asn Pro Lys Thr 
285 

Val Tyr Ala Gly 



Gly Leu Pro Val 
320 

Leu Glu Glu Gin 
335 

lie Leu Ala Thr 
350 

Ala Ala Tyr Glu 
365 

Arg Ala Lys Gly 



Thr Ala Gly Ala 
400 

Gly Leu lie Gin 
415 

Ala Ser Thr Gly 
430 

Phe His Met Gly 
445 

Asp Ala Leu Ala 



Gly Arg Tyr Pro 
480 

Trp Ser Lys Pro 
495 

Tyr Gly I^ys Gly 
510 1 

Pro Val Ala Glu 
525 

Pro Asn Thr Leu 



Ala Ala Pro Lys 
560 

Ala Thr Thr Pro 
575 

Thr Thr Tyr Lys 
590 



Pro Ala Leu Pro 
595 

Ala Gly Glu Ala 
610 

Asn Gly Asn Gly 
625 

Pro Leu Val Ser 



Ala Thr Ala Lys 
600 

Ser Thr Thr Ser 
615 

Asn Gly Ala Ser 
630 

Ala Ala Thr Arg 

645 
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Pro Lys Thr Ala 



Thr Ser Glu Asn 
620 

Ala Ser Lys Thr 
635 

Lys Ser Ala 
650 




Gly Leu Lys Leu 
605 

Gly Ala Ala Ser 

Ser Ala Ala Lys 
640 



<210> 6 

<211> 1314 

<212> DNA 

<213> artificial sequence 
<220> 

<223> fragment of the complete cDNA coding for 
the GBSSI of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (1314) 
<223> 

<4O0> 6 

gcg ctg gac ate gtg 
Ala Leu Asp lie Val 
1 5 



atg gtt get get gag gtc gec cct tgg tec aag 
Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
10 15 



acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag acc ggc tec aag 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 



ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gec get att gag get gee cgc gtg ctg ccc 

Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 



ttc ggc ccc 
Phe Gly Pro 
130 

gcc ctg gtg 
Ala Leu Val 
145 

ttc acc aag 
Phe Thr Lys 



ggc cgc atg 
Gly Arg Met 



gcc ttt gac 
Ala Phe Asp 
195 

gag gcc acc 
Glu Ala Thr 
210 

acc tac aag 
Thr Tyr Lys 
225 

aag ctg gtg 
Lys Leu Val 



gcc gcc ggc 
Ala Ala Gly 



gag ggc att 
Glu Gly lie 
275 

gac aag ttc 
Asp Lys Phe 
290 

aag gcc gcc 
Lys Ala Ala 
305 

gac ccc acc 
Asp Pro Thr 



aag ggt gtg 
Lys Gly Val 



ccc aag gtg 
Pro Lys Val 
355 



ggc gag gac 
Gly Glu Asp 



ccc gtc ctg 
Pro Val Leu 
150 

gcc aag teg 
Ala Lys Ser 
165 

tgg gag gag 
Trp Glu Glu 
180 

aag ctg gcc 
Lys Leu Ala 



ccc atg gag 
Pro Met Glu 



aag ate aac 
Lys lie Asn 
230 

act gtg teg 
Thr Val Ser 
245 

ggt gtg gag 
Gly Val Glu 
260 

gtg aac ggc 
Val Asn Gly 



ctg tct gcg 
Leu Ser Ala 



gcc aag gag 
Ala Lys Glu 
310 

gcc ccc ctg 
Ala Pro Leu 
325 

gac ate ate 
Asp lie lie 
340 

cag ate gcc 
Gin He Ala 



tgc gtc ttc 
Cys Val Phe 
135 

ctg aag gac 
Leu Lys Asp 



gtg ctg get 
Val Leu Ala 



get ttc aag 
Ala Phe Lys 
185 

ttc teg gac 
Phe Ser Asp 
200 

gag gac gag 
Glu Asp Glu 
215 

tgg ctg aag 
Trp Leu Lys 



ccc aac tac 
Pro Asn Tyr 



ctg gac acc 
Leu Asp Thr 
265 

atg gac att 
Met Asp He 
280 

ccc tac gac 
Pro Tyr Asp 
295 

gcc ctg cag 
Ala Leu Gin 



ttc gcc ttc 
Phe Ala Phe 



ctg gcc gcc 
Leu Ala Ala 
345 

ate ctg ggt 
He Leu Gly 
360 



gtg gcc aac 
Val Ala Asn 
140 

gag tac cag 
Glu Tyr Gin 
155 

ate cac aac 
He His Asn 
170 

gac acg aag 
Asp Thr Lys 



ggc tat gcc 
Gly Tyr Ala 



aag ccc ccg 
Lys Pro Pro 
220 

ggt ggc att 
Gly Gly He 
235 

gcg acc gag 
Ala Thr Glu 
250 

gtc ate cgc 
Val He Arg 



gag gag tgg 
Glu Glu Trp 



cag aac age 
Gin Asn Ser 
300 

gcc gag ctg 
Ala Glu Leu 
315 

ate ggc cgc 
He Gly Arg 
330 

ctg ccc aag 
Leu Pro Lys 



acc ggc aag 
Thr Gly Lys 



gac tgg cac 
Asp Trp His 



ccc aag ggc 
Pro Lys Gly 



ate gcc ttc 
lie Ala Phe 
175 

ctg ccc cag 
Leu Pro Gin 
190 

aag gtt tac 
Lys Val Tyr 
205 

ctg acg gga 
Leu Thr Gly 



ate gcc gcc 
He Ala Ala 



ate get gcc 
He Ala Ala 
255 

gcc aag ggc 
Ala Lys Gly 
270 

aac ccc aag 
Asn Pro Lys 
285 

gtc tac gcc 
Val Tyr Ala 



ggc ctg cct 
Gly Leu Pro 



ctg gag gag 
Leu Glu Glu 
335 

ate ctg gcc 
He Leu Ala 
350 

gcc gcc tac 
Ala Ala Tyr 
365 



tec 432 
Ser 



cag 4 80 

Gin 

160 

cag 528 
Gin 



gcc 576 
Ala 



act 624 
Thr 



aag 672 
Lys 



gac 720 

Asp 

240 

gat 768 
Asp 



att 816 
He 



acc 864 
Thr 



ggc 912 
Gly 



gtg 960 

Val 

320 

cag 1008 
Gin 



acc 1056 
Thr 



gag 1104 
Glu 
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aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gec aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc 1314 
Gly Leu Val Asp Thr Val 
435 



<210> 7 

<211> 438 

<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete cDNA coding 

for the GBSSI of Chlamydomonas reinhardtii 

<400> 7 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val irp lie 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 



Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 
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Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 ? 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val 



435 



<210> 
<211> 
<212> 
<213> 



artificial sequence 



8 

1593 
DNA 
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<220> 

<223> fragment of the complete cDNA coding for 
the GBSSI of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (1593) 

<223> 

<400> 8 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cct tgg tec aag 48 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg ace att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gec ctg ttc tgc aag gec get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp riis Ser 
130 135 140 

gec ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gee aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn He Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gee 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 



gec ttt gac aag ctg gee ttc teg gac ggc tat gee aag gtt tac act 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 



624 
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gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 7 68 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 ' 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 1344 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 
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gcc ctg aac ccc gac aag ctg gac gag get gac gec gac gec ctg gec 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc 1593 
Lys lie Pro 
530 



<210> 9 

<211> 531 

<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete cDNA coding for 
the GBSSI of Chlamyciomonas reinhardtii 

<400> 9 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 .40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

Asp His Pro Trp. Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 



Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 
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Ala Leu Val Pro 
145 

Phe Thr Lys Ala 



Gly Arg Met Trp 
180 

Ala Phe Asp Lys 
195 

Glu Ala Thr Pro 
210 

Thr Tyr Lys Lys 
225 

Lys Leu Val Thr 



Ala Ala Gly Gly 
260 

Glu Gly lie Val 
275 

Asp Lys Phe Leu 
290 

Lys Ala Ala Ala 
305 

Asp Pro Thr Ala 



Lys Gly Val Asp 
340 

Pro Lys Val Gin 
355 

Lys Leu Val Asn 
370 

Val Val Lys Phe 
385 

Asp Phe Met Leu 



Leu His Ala Met 
420 

Gly Leu Val Asp 
435 

Ala Leu Asn Pro 
450 



Val Leu Leu Lys 
150 

Lys Ser Val Leu 
165 

Glu Glu. Ala Phe 



Leu Ala Phe Ser 
200 

Met Glu Glu Asp 
215 

lie Asn Trp Leu 
230 

Val Ser Pro Asn 
245 

Val Glu Leu Asp 



Asn Gly Met Asp 
280 

Ser Ala Pro Tyr 
295 

Lys Glu Ala Leu 
310 

Pro Leu Phe Ala 
325 

lie lie Leu Ala 



lie Ala lie Leu 
360 

Ala He Gly Thr 
375 

Ser Ala Pro Leu 
390 

Val Pro Ser Arg 
405 

His Tyr Gly Thr 



Thr Val Lys Glu 
440 

Asp Lys Leu Asp 
455 



Asp Glu Tyr Gin 
155 

Ala He His Asn 
170 

Lys Asp Thr Lys 
185 

Asp Gly Tyr Ala 



Glu Lys Pro Pro 
220 

Lys Gly Gly He 
235 

Tyr Ala Thr Glu 
250 

Thr Val He Arg 
265 

lie Glu Glu Trp 



Asp Gin Asn Ser 
300 

Gin Ala Glu Leu 
315 

Phe He Gly Arg 
330 

Ala Leu Pro Lys 
345 

Gly Thr Gly Lys 



Lys Tyr Lys Gly 
380 

Ala His Met Leu 
395 

Phe Glu Pro Cys 
410 

Val Pro Val Val 
425 

Gly Val Thr Gly 



Glu Ala Asp Ala 
460 



Pro Lys Gly Gin 
160 

He Ala Phe Gin 
175 

Leu Pro Gin Ala 
190 

Lys Val Tyr Thr 
205 

Leu Thr Gly Lys 



He Ala Ala Asp 
240 

He Ala Ala Asp 
255 

Ala Lys Gly He 
270 

Asn Pro Lys Thr 
285 

Val Tyr Ala Gly 



Gly Leu Pro Val 
320 

Leu Glu Glu Gin 
335 

He Leu Ala Thr 
350 

Ala Ala Tyr Glu 
365 

Arg Ala Lys Gly 



Thr Ala Gly Ala 
400 

Gly Leu He Gin 
415 

Ala Ser Thr Gly 
430 

Phe His Met Gly 
445 

Asp Ala Leu Ala 
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Ala Thr Val Arg 
465 

Glu Met Val Ala 



Ala Gin Lys Trp 
500 

Gly Val Ala Thr 
515 



Arg Ala Ser Glu 
470 

Asn Cys lie Ser 
485 

Glu Gly Leu Leu 



Ala Lys Lys Glu 
520 



Val Phe Ala Gly 
475 

Gin Asp Leu Ser 
490 

Glu Glu Val Val 
505 

Glu lie Lys Val 



Gly Arg Tyr Pro 
480 

Trp Ser Lys Pro 
495 

Tyr Gly Lys Gly 
510 

Pro Val Ala Glu 
525 



Lys lie Pro 
530 



